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Abstract 

Target identification is a critical step in the lengthy and expensive process of drug development. Here, we describe 
a genome-wide screening platform that uses systematic overexpression of pooled human ORFs to understand 
drug mode-of-action and resistance mechanisms. We first calibrated our screen with the well-characterized drug 
methotrexate. We then identified new genes involved in the bioactivity of diverse drugs including antineoplastic 
agents and biologically active molecules. Finally, we focused on the transcription factor RH0XF2 whose overexpression 
conferred resistance to DNA damaging agents. This approach represents an orthogonal method for functional 
screening and, to our knowledge, has never been reported before. 



Background 

Biological systems tend to remain phenotypically stable in 
the face of environmental challenges and genetic changes 
[1]. As such, genetic perturbation has become an efficient 
technique to dissect cellular functions. In this study, we 
used the concept of modulating gene dosage in human 
cells to gain insight into drug mode of action. Understand- 
ing the primary mechanism of action as well as the poten- 
tial polypharmacological effects of a drug can provide 
insight into how to reduce detrimental side effects, un- 
cover new applications for novel indication and explain re- 
sistance mechanisms [2-4]. 

Drug resistance in malignant tissues can be categorized 
into three main mechanisms: (i) drug distribution/metabol- 
ism (pharmacokinetics), (ii) heterogeneity of cancer cells 
and (iii) tumor micro-environment [5,6]. Among the other 
cellular mechanisms, gene overexpression (for example, by 
amplification of the drug target) can titrate a drugs effect. 
This is exemplified by the classic case of methotrexate re- 
sistance through the amplification of the gene DHFR in 
neoplastic tissue from an individual with disseminated 
small-cell lung cancer that relapsed during methotrexate 
chemotherapy [7]. Other forms of overexpression resistance 
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include the up-regulation of 1) efflux pumps (for example, 
ATP-binding cassette transporters), 2) survival mechanisms 
(for example, anti-apoptotic proteins), 3) DNA damage 
repair, 4) pathways for drug inactivation and 5) the overex- 
pression of target isotypes. Additionally, proteins down- 
stream of the inhibited target can be modulated in such a 
manner as to bypass the toxic effect of a drug. 

To dissect some of these mechanisms by which drugs 
act within the cell, we postulated that, when overexpressed, 
genes conferring resistance to a lethal chemical treatment 
can illuminate the drug mode of action. Overexpressing 
genes to confer resistance to an otherwise toxic compound 
is a well-established concept. Early studies showed that a 
streptomycin resistance gene cassette could function in a 
bacterial tetracycline resistant plasmid [8] and it was 
shown in other studies that cloning of the mouse dihydro- 
folate reductase into a bacterial plasmid provided resist- 
ance to trimethoprim in Escherichia coli [9] . A large-scale 
approach using the overexpression of a yeast genomic 
DNA library to identify genes conferring resistance to spe- 
cific drugs has validated the concept of gene dosage for the 
discovery of drug targets in eukaryotes [10]. 

A logical extension of this concept is to employ newly 
available biological tools that enable the systematic perturb- 
ation of all protein fianction in human cultured cells using 
RNA interference (RNAi), CRISPR (clustered regularly 
interspaced short palindromic repeats), cDNA libraries, 
transposons or small molecule inhibitors in combination 



o 



BioMed Central 



© 2014 Arnoldo et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under tlie terms of tlie Creative 
Commons Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and 
reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain 
Dedication waiver (http://creativecommons.0rg/publicdomain/zero/l.O/) applies to the data made available in this article, 
unless otherwise stated. 



Arnoldo et at. Genome Medicine 2014, 6:32 
http://genonnennedicine.conn/content/6/4/32 



Page 2 of 16 



with gene overexpression. Although testing all important 
therapeutic drugs against all proteins from the approxi- 
mately 20,000 genes of the human genome in all differenti- 
ated cell types is not currently feasible, we report the initial 
development of a gain-of-function approach to identify 
drug mode of action by the overexpression of 12,200 hu- 
man ORFs (hORFs) [11,12] in the human cell line HEK293. 

In summary, we have developed a new experimental 
pipeline to identify genes whose up-regulation suppresses 
the toxic effect of chemicals in human cells and have suc- 
cessfully applied this strategy to seven pharmacological 
compounds. 

Methods 

Cell culture and plasmids 

Human embryonic kidney HEK293 (ATCC), HEK293T 
(ATCC), non-small cell lung carcinoma NCI-H1299 (ATCC), 
chondrosarcoma SW1353 (gift from Johanne Martel- 
Pelletier, University of Montreal, Canada) and pancreatic 
adenocarcinoma HPAC (ATCC) cells were maintained 
in DMEM (Wisent Inc. Montreal, Quebec, Canada) sup- 
plemented with 10% fetal bovine serum (Gibco, Carlsbad, 
California, US) at 37°C and 5% CO2. The rtTA express- 
ing HEK293_M2 (ATCC), human breast adenocarcin- 
oma MCF7_M2 (ATCC) and adenocarcinomic human 
alveolar basal epithelial A549_M2 (ATCC) cells were 
maintained in DMEM (Wisent Inc.) supplemented with 
10% fetal bovine serum (Gibco) at 37°C and 5% CO2. 
Blast phase chronic myelogenous leukemia K562 cells 
(gift from Reinhart Reithmeier, University of Toronto, 
Canada) were maintained in RPMI1640 (Sigma, St. Louis, 
Missouri, US) supplemented with 10% fetal bovine 
serum (Gibco) at 37°C and 5% CO2. Multiple myeloma 
U266B1 cells (gift from Aaron Schimmer, Princess 
Margaret Cancer Centre, Canada) were maintained in 
Iscove's modified Dulbecco's medium (IMDM; Gibco) sup- 
plemented with 10% fetal bovine serum (Gibco) at 37°C 
and 5% CO2. Thyroid gland medullary carcinoma TT cells 
(gift from Gilbert Cote, MD Anderson Cancer Center, 
USA) were maintained in Hams F-12 K (Gibco) supple- 
mented with 10% fetal bovine serum (Gibco) at 37°C and 
5% CO2. 

The lentiviral destination vector pLD-T-IRES-Venus- 
WPRE-STOP was cloned using the following procedure: 
the Gateway cassette from pLS-Dest-EcF (gift of Dr Tony 
Pawson, University of Toronto, Canada) was subcloned into 
pLJM17 (J Moffat, unpublished) using the restriction en- 
zymes Mlul and Xbal, resulting in the pLD-puro-TRE (T) 
vector. IRES-Venus-WPRE were PCR amplified from the 
pSLIK- Venus [13] using the following primers: XcmI_IR- 
ES_F (5 -GCGCCTTTTCCAAGGCAGCCCTGGAATTC 
CGCCCCTCTCCCTCC) and NsiI_WPRE_R (5'-AAACA 
ATGCATGTCGACGCGGGGAGGCGGCCCAAAGGGA 
GATCC). The IRES-Venus-WPRE amplicon was cloned 



into the pLD-puro-T vector using the restriction en- 
zymes Xcml and Nsil to replace the human phospho- 
glycerate kinase promoter and puromycin resistance 
gene. A stop codon was introduced directly downstream 
of the Gateway cassette by digesting pLD-T-IRES-Venus- 
WPRE with the restriction enzyme Xbal and blunted 
using DNA polymerase I, large (Klenow) fragment (New 
England Biolabs Inc., Beverly, Massachusetts, US). 

The piggyBac PB-TGcMV-Neo plasmid used for the hit 
confirmation was derived from the PB-TET (AddGene, 
Cambridge, Massachusetts, US). The 'IRES-beta-Geo' 
fragment from PB-TET was replaced by the 'promoter 
PGK-neomycin resistance gene' fragment by homologous 
recombination in Saccharomyces cerevisiae. Briefly, PB- 
TET was digested with RsrII and Apal to remove the 
beta-geo gene. The fragment coding for the promoter 
PGK and neomycin resistance gene, the URA3 cassette 
and the yeast 2 (i-origin of replication were PCR amplified 
and cloned into the digested PB-TET plasmid by homolo- 
gous recombination in yeast. Yeast was then transformed 
with the p414-Cre plasmid in order to excise the URA3 
cassette and the 2 (i-origin of replication after expression 
of the Cre recombinase. 

All short hairpin RNAs (shRNAs) against the RHOXF2 
and green fluorescent protein (GFP) mRNAs were de- 
rived from the RNAi Consortium (TRC) lentiviral librar- 
ies and obtained in the pLKO.l vector. Five shRNAs 
against RHOXF2 were tested and were designed against 
the following regions: 5'-CGGGATGAGAGATGATT 
ACTT (RHOXF2_shRNA_l); 5'-GACGAGAAAGAACT 
ACAGGAT (RHOXF2_shRNA_2); 5'-AGAAGCATGA 
ATGTGACTGAA (RHOXF2_shRNA_3); 5'-GAGGGC 
ATTAATGGCAAGAAA (RHOXF2_shRNA_4); 5'-GT 
CGCTTACTGAAGAGGTCAA (RHOXF2_shRNA_5). 

Library preparation, virus production and lentiviral 
infection 

The 12,212 hORF collection (representing 10,214 distinct 
genes) represents version 3.1 of an ongoing effort to create 
a complete human set of protein-encoding genes [11,12]. 
Initially based on the Mammalian Gene Collection cDNA 
collection, that work transfers full-length ORFs (excluding 
the 5 ' and 3 ' mRNA untranslated regions) into a Gateway 
system. Version 3.1 was obtained from Open Biosystems 
(Thermo Scientific, Waltham, Massachusetts, US) and re- 
groups non-fully sequenced verified and non-clonal ORFs 
(limitations corrected in the 8.1 version [14]). The col- 
lection was divided into 34 minipools of 376 hORFs; 
for each minipool, the hORFs were cloned en masse 
from the pDONR223 into the lentiviral expression vec- 
tor pLD-T-IRES-Venus-WPRE-STOP by Gateway LR 
reaction. After electroporation, transformants were se- 
lected on Luria Broth (LB) plus ampiciUin and the lenti- 
viral vector was extracted. 
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Lentivirus was produced by normalizing the amount of 
DNA for the 34 hORF minipools and by co-transfection 
with the packaging plasmid psPAX2 and the envelope 
plasmid pMD2.G into the packaging HEK293T cells using 
FuGENE (Roche Mississauga, Ontario, Canada). 

HEK293_M2 cells were then infected at a multiplicity of 
infection of 03. Doxycycline (2 (ig/ml) was added to in- 
duce the expression of the Venus fluorescent protein and 
Venus-positive cells were sorted using a BD FACS Aria 
Cell Sorter (East Ruherford, New Jersey, US). Because the 
expression of the gene coding for Venus is linked to the 
expression of the hORFs, selection of the fluorescent cells 
allowed the selection for non-silent, stably integrated 
lentivirus as well as functional inducible hORFs. 

Genomic DNA extraction, library preparation and data 
analysis for the hORFeome representation in 
HEK293_M2 cells 

We thawed 6.3 x 10^ HEK293_M2 cells harboring the 
virally integrated human ORFeome version 3.1 collection 
and these were cultured for 1 day to recover. Genomic 
DNA was recovered from 16 x 10^ cells and 6 (ig of gen- 
omic DNA (gDNA were used to amplif)^ the collection 
(three PCR reactions with 2 (ig of gDNA). Following 
PCR, the amplified ORFs were used directly to prepare 
a Nextera sequencing library (Illumina Nextera DNA 
Sample Preparation Kit, San Diego, California, US) and 
sequenced on an Illumina MiSeq according to the man- 
ufacturers protocol. Fifty- nucleotide paired-end reads 
were aligned against a reference database consisting of 
the ORE sequences in the hORFeome 3.1 collection 
using bwa (version 0.6.1). Coverage at each base position 
in each reference sequence was determined using the 
bedtools program genomeCoverageBed (v2.14.2). A cus- 
tom Perl script was used to summarize the coverage 
levels to determine the mean coverage for each reference 
sequence and the proportion of each reference sequence 
with different levels of coverage. Raw data have been de- 
posited in the ArrayExpress repository under accession 
number E-MTAB-2498. 

hORFeome drug target screen 

Aliquots of 500,000 HEK293_M2 cells were seeded in a 
T175 flask and grown for 15 hours before the induction of 
hORF expression by addition of doxycycline. After 18 hours 
of induction, cells formed micro-colonies that were cul- 
tured in the presence of drug or DMSO (as control) for 2 
to 3 weeks until distinct colonies were detectable. 

After each screen, surviving cells were harvested and 
gDNA was extracted as follows. Cells were lysed using 
SNET buffer (10 mM Tris pH 8.0, 0.1 M EDTA, 0.5% 
SDS, 0.1 mg/ml Proteinase K, 25 (ig/ml RNAse A). gDNA 
was isolated using one volume of phenol/chloroform/isoa- 
myl alcohol (25:24:1) pH 8.0, precipitated by adding two 



volumes of 95% ethanol (-20°C), washed with ice cold 70% 
ethanol and resuspended in Tris-HCl pH 8.0. 

During the screen selection experiments, only a minority 
of hORFs protected HEK293_M2s against the cytotoxic 
effect of the drug. Compared to the initial cell population, 
HEK293_M2 cells harboring the most resistant genes 
should be over-represented. In order to further amplify 
the set of resistant genes from the resulting reduced popu- 
lation, hORFs were PCR amplified from only 80 ng of gen- 
omic DNA. The sequences of the forward and reverse 
primers specific for the inserted hORFs were 5'-CGGT 
ACCCGGGGATCCTCTAGTCAGCTGAC and 5'-CCAT 
TTGTCTCGAGGTCGAGAATTCTAGCTAGAATC, re- 
spectively. Each reaction was carried out in a 50 \A volume 
containing 25 (il of 2x Phusion Flash High- Fidelity Master 
Mix (Finnzymes, Espoo, Finland), 200 nM of each primer 
and 80 ng of genomic DNA. The PCR profile was: 1 mi- 
nute at 98°C for one cycle; 10 s at 98°C, 20 s at 65°C, 4 mi- 
nutes at 72°C for 35 cycles; 10 minutes at 72°C for one 
cycle. PCR products were purified using QIAquick PCR 
Purification Kit (QIAGEN Venlo, Limburg, Netherlands), 
150 ng of purified PCR product was biotinylated using 
the BioPrime DNA Labeling Kit (Invitrogen, Carlsbad, 
California, US) and unincorporated biotin-14-dCTP was 
removed by passing the samples through Sephadex G-50 
columns (GE Healthcare Fairfield, Connecticut, US). 
Sample (150 ng) was added to the Affymetrix GeneChip 
Human Gene 1.0 ST array and hybridized at 45°C for 
17 hours with a rotation of 60 rpm. Chips were washed and 
stained with SAPE (2x MES staining buffer, 20 mg/mL bo- 
vine serum albumin (BSA) and 1 mg/ml streptavidin- 
phycoerythrin), washed on an Affymetrix fluidics station 
and scanned. 

Representation of the complete hORFeome collection 
was assessed using the same experimental conditions 
with some modifications. gDNA was extracted from at 
least 16 X 10^ cells (approximately 1,300 cells per hORF), 
3 PCR reactions were performed using 2 (ig of gDNA 
each (approximately 75 genomes per hORFs), two bio- 
tinylation reactions were performed using 500 ng of 
combined PCR products and 3.5 (ig of sample was added 
to an Affymetrix GeneChip Human Gene 1.0 ST array. 

Microarray raw data have been deposited in the 
ArrayExpress repository [15] under accession number E- 
MTAB-2493. 

Selection and individual validation of over-represented 
hORFs 

The hORFs conferring resistance to the drug were iden- 
tified in a step-wise procedure. Screen results were first 
visualized by plotting the log2 of the signal intensity for 
each hORF retrieved via PCR from cells cultured in the 
presence of drug (on the x-axis) and by plotting the log2 
ratio of the signal intensity for each hORF of the cells 
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cultured in the presence of drug divided by the signal in- 
tensity for each hORF of the cells grown in the presence 
of DMSO (on the y-axis). Genes with a log2(drug/DMSO) 
>3 and a log2(drug) >6 enrichment were then listed and 
ranked. Primary hits selected for validation were those 
previously enriched genes in common among the drug 
screen replicates (at least two independent experiments). 

For validation, 10,000 HEK293-M2 cells were seeded in 
300 (il of DMEM plus 10% iFBS plus penicillin/strepto- 
mycin in a 48-well plate pretreated with poly-L-lysine 
(Sigma, P4832). After 12 hours of incubation, gene expres- 
sion was induced by the addition of doxycycline (2 (ig/ml 
final concentration). After 18 hours of induction, drug 
was added and cell growth was assessed 3 days after drug 
addition by sulforhodamine B (SRB) assay as previously 
described [16]. 

Immunostaining 

The HEK293_M2 + RHOXF2 stable cell line (PB-TGcMV- 
Neo) were seeded in an eight-well chamber slide (BD 
Falcon, East Ruherford, New Jersey, US.) and cultivated 
overnight in the presence or absence of doxycyclin (2 (ig/ml 
final). Cells were treated with 50 nM of mitomycin C for 
2 days before fixation with 2% paraformaldehyde and 
permeabilization with 0.3% Triton X-100. Cells were 
blocked for 30 minutes with blocking/dilution buffer 
(10% goat serum, 0.5% NP-40, 0.5% saponin, Ix 
phosphate-buffered saline (PBS)) and incubated with the 
anti-phospho-histone H2A.X (serine 139, clone JBW301; 
Millipore, Billerica, Massachusetts, US) overnight at 4°C. 
Cells were then incubated with an Alexa 546 anti-mouse 
antibody for 1 hour at room temperature, incubated with 
DAPI (0.8 (ig/ml; Sigma) for 10 minutes and mounted 
using ProLong Gold (Invitrogen). Images were captured 
using a 40x dry objective. Nuclei and y-H2A.X foci were 
quantified using CellProfiler [17]. 

RNA extraction and library preparation for gene expression 
profiling of the RH0XF2 overexpressing cell line 

The HEK293_M2 + RHOXF2 stable cell line (PB-TGcMV- 
Neo) was seeded in 10-cm dishes, cultivated 24 hours in 
the presence or absence of doxycycline (2 (ig/ml final) be- 
fore being treated with 40 nM of mitomycin C or DMSO 
(control) during a 2-day period. Total RNA was extracted 
(RNeasy Mini Kit, QIAGEN), mRNA-focused libraries 
were generated from 1 (ig of total RNA (lUumina TruSeq 
RNA Sample Preparation Kit V.2) and sequenced using an 
Illumina HiSeq2000 according to the Illumina protocol. 
Raw data have been deposited in the ArrayExpress reposi- 
tory under accession number E-MTAB-2497. 

RNA-sequencing data analysis 

Single-end reads, 51 nucleotides in length, were gener- 
ated on the Illumina HiSeq. Sequence data were aligned 



to the UCSC hgl9 reference genome using TopHat 
(v2.0.0), provided with a RefSeq GTF file and instructed 
to align only across known junctions. Differential ex- 
pression, as fragments per kilobase of exon per million 
fragments mapped (FPKM) differences, were generated 
by analysis with cuffdiff (vl.1.0) using the indicated com- 
parisons. Based on their differential expression, potential 
activated and repressed isoforms were identified and re- 
lated enriched biological processes were assessed using 
GOrilla [18]. 

RH0XF2 quantitative RT-PCR 

Total RNA from 20 different normal human tissues 
(each pool from 3 donors) were obtained from the First- 
Choice Human Total RNA Survey Panel (Ambion, Austin, 
Texas, US). Total RNA (5 (ig) from each tissue was used 
for oligo(dT)12-18-primed reverse transcription using the 
Superscript II reverse transcriptase (Invitrogen). Quantita- 
tive PCR was performed using the LightCycler 480 SYBR 
Green I Master (Roche) and the LightCycler 480 System 
with the following primers: 

QhRHOXFl_lF (5'-ACCGTGTTCTACTGCCTGAGT 
GTA) and QhRHOXFl_lR (5 '-TTCATGCCGTTCTCGT 
GGTTCACA) on RHOXFl exon 1; QhRHOXFl_2F (5 '-T 
GGAGGAGCTGGAAAGTGTT) and QhRHOXFl_2.2R 
(5'-GGCCCTTTTATTCTTAAACC) spanning RHOXFl 
exon 1 and exon 3; QhRHOXF2_lF (5'-CCGGACCAGT 
GTAGCCAGTA) and QhRHOXF2_lR (5'-TCTTTTTCT 
TCTCCGCCTTG) spanning RHOXF2 exon 1 and exon 2; 
QhRHOXF2_2F (5 '-ATGGTGCTGTCGCTTACTGA) and 
QhRHOXF2_2R (5'-TCGAGGTCTCCTTCCCATAG) on 
RHOXF2 exon 2; QhRHOXF2_4F (5'-CAGCGGGATGAG 
AGATGATT) and QhRHOXF2_4R (5'-TTGGGGAATGT 
GAAAGAAGG) on RHOXF2 exon 3. 

Housekeeping/reference genes were: QhCYCG_F (5'-CT 
TGTCAATGGCCAACAGAGG) and QhCYCG_R (5'-GC 
CCATCTAAATGAGGAGTTGGT) on CYCG (cyclophilin 
G); QhGUSB_F (5'-ACGCAGAAAATATGTGGTTGGA) 
and QhGUSB_R (5'-GCACTCTCGTCGGTGACTGTT) 
on GUSB (beta-glucuronidase); QhActbFl (5'-GAAGTC 
CCTTGCCATCCTAAAAG) and QhActbRl (5'-AGGA 
CTGGGCCATTCTCCTTA) on ACTB (beta-actin); QhEe- 
flalF2 (5'-CTGAACCATCCAGGCCAAAT) and QhEefla 
1R2 (5'-AGCCGTGTGGCAATCCA) on EEFlAl (eukary- 
otic translation elongation factor 1 alpha 1); QhGAP 
DH_F1 (5'-CATGAGAAGTATGACAACAGCCT) and 
QhGAPDH_Rl (5 -AGTCCTTCCACGATACCAAAGT) 
on GAPDH (glyceraldehyde-3-phosphate dehydrogenase). 

RH0XF2 immunohistochemistry 

Formalin-fixed, paraffin-embedded sections of human 
normal testis were deparaffinized for 12 hours at 58°C 
and rehydrated. Heat-induced epitope retrieval was in 
sodium citrate buffer pH 6.0 using a pressure cooker 
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[19]. All antibodies were diluted in Tris-buffered Saline 
(TBS) with 1% BSA. Sections were blocked in 10% goat 
serum (Gibco) for 2 hours at room temperature. The 
anti-RHOXF2 primary antibody (Sigma, HPA003314) 
was used at a 1:150 dilution and incubated overnight at 4°C 
in a humid chamber. Endogenous peroxidase activity was 
suppressed by incubating the section in 0.3% H2O2 for 
15 minutes. Horse radish peroxidase (HRP) -conjugated sec- 
ondary antibody (anti-rabbit IgG-HRP; GE Healthcare) was 
used at a 1:250 dilution and incubated for 1 hour at room 
temperature. 3,3'-Diaminobenzidine (Abeam) was used as 
chromogen. Sections were counterstained with Mayers 
hematoxylin (Sigma) before dehydration and mounting. 

Western blotting 

For the detection of endogenous RHOXF2, total cell ly- 
sates were prepared from 200,000 cells. Briefly, samples 
were lysed in 2x SB loading buffer (60 mM Tris-HCl 
pH6.8, 3% SDS, 10% glycerol, 0.05% bromophenol blue, 
10% beta-mercaptoethanol) and subjected to SDS -PAGE 
(12%). The RHOXF2-specific mouse polyclonal antibody 
(Abeam, ab67811) and the tubulin specific rat antibody 
(Abeam, ab6160) were used at a dilution of 1:800 and 
1:40,000 respectively. Immunological complexes were vi- 
sualized by enhanced chemiluminescence using HRP- 
conjugated anti mouse or anti rat IgG. 

Drug dose-response curves 

K562 cells (100,000) were cultured in 600 \A of media in 
a 48-well plate for 24 hours before the addition of di- 
luted drug. Two days after drug treatment, the percent- 
age of drug inhibition was assessed by counting K562 
using a Coulter particle count and cell analyser and 
compared to cells grown in the presence of DMSO only. 

For the adherent HPAC, SW1353, and NCI-H1299 cell 
lines, 5,000 cells were seeded in 300 (il of media in 96- 
well plate and incubated for 24 hours prior to drug 
addition. Following two days of drug treatment, the per- 
centage of drug inhibition was estimated by SRB assay 
[16] and compared to cells grown in DMSO as control. 

Results 

Development of the assay: DHFR and methotrexate 

We developed an original phenotypic assay in which the 
mammalian cell line HEK293_M2 was used to identify 
hORFs capable of rescuing small molecule toxicity when 
overexpressed (Figure 1; Additional file 1). As a proof of 
concept, we rescued methotrexate toxicity by overex- 
pression of its target, the gene encoding the enzyme 
dihydrofolate reductase (DHFR). 

We first cloned the human DHFR gene into the Gateway- 
compatible lentiviral vector pLD-T-IRES-Venus-WPRE- 
STOP where gene expression is under the control of the 
Tet-On promotor (Additional file 2). After transduction 



of the rtTA-expressing cell line HEK293_M2 by lentivirus, 
we detected a high level of protein in the presence of 
doxycycline by western blotting (Figure 2a; Additional file 
3). No protein was detected in the absence of doxycycline, 
suggesting robust control of transgene expression. Cells 
were next exposed to increasing doses of methotrexate in 
the absence or presence of different doses of doxycycline. 
Increasing levels of DHFR overexpression in HEK293_M2 
cells rescued methotrexate toxicity in a dose-dependent 
manner, demonstrating that the lentiviral construct was 
functional (Figure 2b; Additional file 3). 

We implemented our approach on a larger scale by 
cloning, en masses a pool of 376 random hORFs (including 
DHFR) into the lentiviral vector. Corresponding lentivirus 
were used to generate a population of HEK293_M2 cells 
with the stably integrated hORFs. Gene expression was 
induced as before by addition of doxycycline and cells 
were exposed to a lethal concentration of methotrexate. 
Ten days after treatment, surviving cells were harvested, 
genomic DNA was extracted, and hORFs were PCR amp- 
lified and hybridized on an Affymetrix GeneChip Human 
Gene 1.0 ST array to determine which hORFs conferred 
resistance to methotrexate. DHFR was found as the pre- 
dominant gene in this pool of 376 hORFs whose overex- 
pression provided resistance to methotrexate, validating 
our strategy at the scale of several hundred pooled clones 
(Additional file 4). 

The same approach was next applied to the complete 
collection of 12,212 hORFs. Here, we divided the collec- 
tion into 34 minipools of 376 hORFs each, transferred 
them en masse into the pLD-T-IRES -Venus- WPRE-STOP 
lentivirus vector and then pooled 34 minipools together to 
produce sufficient quantities of lentivirus for the infection 
of HEK293_M2 cells at a multiplicity of infection of 0.3 
(Figure 1). Because induction of the Venus marker is 
dependent on hORF expression, addition of doxycycline 
for 24 hours generated sufficient fluorescence to sort 
those cells with stable lentivirus integration (Additional 
file 5). The presence and relative abundance of the virally 
integrated hORFs were assessed using next-generation se- 
quencing. Based on the identification of at least 95% of 
the sequence with a minimum of lOx coverage, 87% of 
the mappable hORFs were considered as present in the 
HEK293_M2 cells. Further analysis of the sequencing re- 
sults demonstrated a size-dependent relative abundance 
of the hORFs, with smaller clones being more represented 
in the population (Additional file 6). An aliquot of 500,000 
HEK293_M2 cells (an average of 41 cells per hORF) was 
seeded in a T175 flask, and grown for 15 hours before 
the induction of hORF expression by addition of doxy- 
cycline. After 18 hours of doxycycline induction, cells 
formed micro-colonies, which were cultured in the pres- 
ence of methotrexate for 2 weeks. hORFs derived from 
the surviving cells were identified by PCR amplification 
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Drug selection 



Figure 1 A gain-of-function cell-based assay to characterize the mode of action of small chemical compounds, (a) A collection of 12,212 
hORFs was cloned en mosse into the lentiviral expression vector pLD-IRES-Venus-WPRE-STOP. The resulting constructs were used to produce 
lentivirus and infect the rtTA-expressing cell line HEK293_M2 at a multiplicity of infection (MOI) of 0.3. Sorting of the Venus-positive cells resulted in the 
isolation of human cells with the functional integrated lentiviral constructs, (b) After seeding, doxycycline (dox) was added to the cells in order to in- 
duce the expression of the hORFs. After selection in the presence of a lethal dose of drug, surviving cells were harvested, gDNA extracted and the na- 
ture of the hORFs providing resistance to the chemical identified after hybridization on microarray. NGS, next-generation sequencing. 



and microarray hybridization. As we found for the mini- 
pool experiment, DHFR was the top gene conferring re- 
sistance to methotrexate when overexpressed (Figure 2c). 
These results encouraged us to test the same strategy for 
drugs with diverse modes of action. 



Screens with additional drugs 

We tested six additional diverse drugs, including the 
antifolate aminopterin, the lipase inhibitor orlistat, 
the Hsp90 inhibitor 17-(dimethylaminoethylamino)-17- 
demethoxygeldanamycin (a geldanamycin analog) and 



a c 



doxpg/ml 0 0.004 0.016 0.063 2 




0 A > > r 

15 60 240 960 

[methotrexate] nM 



Figure 2 DHFR was identified as the top candidate during a large-scale screen against methotrexate, (a) Immunoblotting sliowing tine 
conditional expression of DHFR in HEK293_M2 cells. Addition of 4 ng/ml, 16 ng/ml, 63 ng/ml and 2,000 ng/ml of doxycycline (dox) induced 
DHFR protein expression in the stable cell line HEK293_M2. Alpha tubulin was used as a loading control. For clarity, protein expression level at 
0.0032 ng/ml of doxycycline has been omitted, (b) Single DHFR overexpression rescued methotrexate toxicity in HEK293_M2 cells. The gene 
coding for DHFR was cloned into the lentiviral vector and used to create stable HEK293_M2 cells able to conditionally express DHFR. Dose- 
response curves of the cells grown in the presence of an increasing concentration of doxycycline were calculated after 2 days of exposure to 15, 
30, 60, 125, 250, 500 and 1,000 nM of methotrexate and compared with those for cells cultured in the presence of DMSO as control. Cell survival 
was quantified using a SRB assay. Errors bars represent the standard deviation for triplicate assays, (c) Identification of genes conferring resistance 
to methotrexate when overexpressed. HEK293_M2 cells harboring the 12,212 hORF collection were grown in the presence of a lethal dose of 
methotrexate. The nature of the hORFs conferring resistance to the drug was identified by plotting the log2 of the signal intensity for each hORF 
in the cells cultured in the presence of methotrexate on the x-axis; and by plotting the log2 ratio of the signal intensity for each hORF of the cells 
cultured in the presence of the drug divided by the signal intensity for each hORF of the cells grown in the presence of DMSO on the y-axis. 
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three DNA damaging agents (bleomycin, cisplatin and 
mitomycin C) based on the diversity of their mode of 
action (Table 1). 

Each drug was screened at least twice and the top hits 
(those hORFs most enriched in the final sample of 
cells; see Methods) were selected for validation. Of 123 
candidates, we successfully cloned 120 hORFs into 
the modified Gateway-compatible piggyBac vector PB- 
TGcMV-Neo (Additional file 7) and generated 120 inde- 
pendent HEK293_M2 stable cell lines. Protection against 
the drug toxicity was assessed two days after culturing 
cells in the presence of four concentrations of each drug 
with or without doxycycline induction. Relative cell dens- 
ities were compared with cells containing an empty vector 
control construct and were quantified by SRB assay. A 
total of 17 unique hits were confirmed to rescue their re- 
spective drug toxicity (Table 2). The number of validated 
hits varied considerably according to the compound, with 
bleomycin and 17-DMAG having low validation rates (7% 
and 3%, respectively) and, in contrast, cisplatin and mito- 
mycin C having high validation levels (27% and 42%, re- 
spectively) (Table 2). Several scenarios, both technical and 
biological, may explain this modest confirmation rate. 
First, PGR amplification of the hORFs from gDNA and 
probe cross-hybridization on microarrays can be expected 
to generate a certain percentage of false positives, despite 
each screen having been performed in duplicate. Second, 
since this version of the hORFeome library is not fully 



sequence verified, we speculate that some genes contain 
mutations that impair proper expression. We attempted to 
minimize the impact of mutations by restriction verifying 
four independent colonies from the original pDONR233 
bacterial stock, cloning positive clones as a pool for sub- 
sequent cloning into the PB-TGcMV-Neo vector and 
transfection into HEK293_M2 cells. Finally, the biological 
mechanisms by which these agents work will influence the 
number of hits obtained from each screen, and therefore 
affect the number of hits that can be confirmed. For this 
proof-of-principle study we chose to be conservative, 
selecting more hits for validation at the cost of a high 
false-positive rate. 

Screens for the antlfolates methotrexate and aminopterin 

For the antifolate screens, the single confirmed ORE was 
DHFR, This result highlights the fact that when a single 
hORF confers a strong growth advantage, it can domin- 
ate the other clones in the population and potentially 
mask the detection of other potential protective genes. 
One way to address this and potentially improve the dy- 
namic range for this drug class would be to repeat these 
screens with a pool of hORFs lacking DHFR, 

Among the potential expected hits present in our col- 
lection was the enzyme gamma- glutamyl hydrolase 
(GGH), which catalyzes the removal of polyglutamates 
from methotrexate. Therefore, overexpression of GGH 
would be expected to decrease the intracellular presence 



Table 1 Characteristics of screened compounds 



Drug name Screen concentration Category* 



Clinical use 



Known target(s) Reference 



Methotrexate 



Aminopterin 



Orlistat 



Bleomycin 



Cisplatin 
Mitomycin C 

17 DMAG 



60 nM 



20 nM 



20 mM 



1 \M 



2 mM 
80 nM 

70 nM 



Antimetabolites/ immunosuppressants Antineoplastic 

Antirheumatic 



Antimetabolites (discontinued) 



Antiobesity preparations, excluding 
diet products 



Cytotoxic antibiotics and related 
substances 



Other antineoplastic agents 

Cytotoxic antibiotics and related 
substances 

Experimental Hsp90 inhibitor 



Dermatologic agent 
Dermatologic agent (phase I) 
Antineoplastic agent (phase II) 
Anti-obesity agent 



Antineoplastic agent 



Antineoplastic agent 
Antineoplastic agent 

Antineoplastic agent (phases I and I 



DHFR 



DHFR, FPGS(?) 

LPL 

PNLIP 

FASN 

DNA 

LIG1 

LIG3 

DNA 

DNA 

Hsp90 

HSP90AA1 

HSP90AA2 

HSP90B1 

HSP90AB1 



[20] 

[21] 

[22] 
[23] 

[24] 



[25] 
[26] 

[27] 
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Table 2 Validated hits 



Drug 


Gene name 


Accession number 


Description 


Number of validated/tested 
candidate genes 


Methotrexate 


DHFR 


BC000192 


Diliydrofolate reductase 


1/3 


Aminopterin 


DHFR 


BC000192 


Diliydrofolate reductase 


1/9 


Bleomycin 


IMMT 


BC002412 


Inner membrane protein, mitochondrial (mitofilin) 


2/30 




STX3 


BC007405 


Syntaxin 3 




Cisplatin 


MAP2K1IP1 


BC026245 


Mitogen-activated protein kinase kinase 1 interacting protein 1 


4/15 




MMD 


BC026324 


Monocyte to macrophage differentiation-associated 






PTPN2 


BC016727 


Protein tyrosine phosphatase, non-receptor type 2 






RH0XF2 


BC021719 


Rhox homeobox family, member 2 




Mitomycin C 


C20orf54 


BC009750 


Chromosome 20 open reading frame 54 


8/19 




CCDC45 


BC009518 


Coiled-coil domain containing 45 






ELF5 


BC029743 


E74-like factor 5 (ets domain transcription factor) 






PTPN2 


BC008244 


Protein tyrosine phosphatase, non-receptor type 2 






RH0XF2 


BC021719 


Rhox homeobox family, member 2 






TMEM150 


BC050466 


Transmembrane protein 150 






USPLl 


BC038103 


Ubiquitin specific peptidase like 1 






ZFP64 


BC012759 


Zinc finger protein 64 homolog (mouse) 




1 7-DMAG 


EIF4B 


BC073139 


Eukaryotic translation initiation factor 4B 


1/29 


Orlistat 


MGLL 


BC006230 


Monoglyceride lipase 


2/18 




PONS 


BC070374 


Paraoxonase 3 





of methotrexate-polyglutamate, thereby increasing the 
efflux of methotrexate out of the cell and decreasing 
cytotoxicity. However, GGH overexpression was demon- 
strated to be insufficient to produce resistance to the 
drug in a human fibrosarcoma (HT-1080) and a human 
breast carcinoma (MCF-7) cell line [28]. 

DHFRLl (dihydrofolate reductase like-1) consistently 
appeared as a top hit for screens with both methotrexate 
and aminopterin but failed to confer resistance when in- 
dividually expressed in HEK293_M2 cells. Based on the 
strong similarity between DHFR and DHFRLl (a total of 
14 non-synonymous changes), we hypothesized that the 
observed unspecific signal was probably due to cross - 
hybridization. Interestingly, a similar problem occurred 
when the same strategy using the same hORFeome li- 
brary was tested in the model organism S. cerevisiae. 
Both DHFR and DHFRLl were enriched in the screens 
against methotrexate (and their expression confirmed at 
the protein level) but only DHFR conferred methotrex- 
ate resistance (data not shown). 

Orlistat screens 

Orlistat is a lipase inhibitor that reportedly targets LPL 
(lipoprotein lipase), PNLIP (pancreatic lipase) and FASN 
(fatty acid synthase). Although LPL is absent from our col- 
lection, PNLIP and FASN are present. However, genes for 
the latter two did not confer overexpression resistance. 



Our screen identified the gene coding for monoglycer- 
ide lipase (MGLL) as a resistance hit. MGLL hydrolyzes 
intracellular triglyceride stores in adipocytes and other 
cells to fatty acids and glycerol. The enzyme might also 
be involved in the hydrolysis of monoglycerides [29]. In 
vitro experiments suggest that orlistat can inhibit the 
MGLL-like activity in rat-cerebellar membranes and rat 
cerebellar homogenate [30], rendering the human MGLL 
a potential direct target for orlistat. 

Another validated hit, PONS, is also involved in lipid 
metabolism. The gene encodes paraoxonase 3, which 
is secreted into the bloodstream and associates with 
high-density lipoprotein. The protein can also rapidly 
hydrolyze lactones and inhibit the oxidation of low- 
density lipoprotein. 

Bleomycin screens 

Although the exact mechanism of action of bleomycin is 
unknown, available evidence indicates that its main mode 
of action is via the inhibition of DNA synthesis, with add- 
itional evidence for its inhibition of RNA and, to a lesser 
extent, protein synthesis [31]. 

We show that overexpression of IMMT (inner mem- 
brane protein, mitochondrial) partially rescues bleomycin 
toxicity. Bleomycin treatment induces damage of both nu- 
clear and mitochondrial DNA [32]. A recent study demon- 
strated that the mitochondrial localization of PARP-1 
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requires interaction with IMMT and that it is involved in 
the maintenance of mitochondrial DNA integrity [33]. The 
authors also suggest that IMMT overexpression results in 
an increase of PARP-1 in the extracellular compartment. 
We therefore speculate that IMMT overexpression recruits 
sufficient PARP-1 into the mitochondrion to maintain 
mitochondrial DNA integrity and promote cell survival. 

Cisplatin and mitomycin C screens 

RHOXF2 and PTPN2 were found to suppress the tox- 
icity of cisplatin and mitomycin C. Together with the 
antifolates, these DNA damaging agents were the only 
screens yielding identical hits. Also, as for methotrexate 
and aminopterin, both these alkylating agents have the 
same mechanism of action. 

Another validated hit for mitomycin C, ZFP64 (zinc 
finger protein 64 homolog), is noteworthy because it has 
been shown to be potentially phosphorylated by ATM 
and ATR, the major signal transducing kinases of the 
DNA damage response in response to ionizing radiation 
[34]. This observation is consistent with a potential role 
for ZFP64 in the repair of DNA damage generated by 
both alkylating agents and ionizing radiation. 

RH0XF2 overexpression confers resistance to several DNA 
damaging agents 

Among the top candidates from the screens and confirma- 
tions, we focused on RHOXF2, a relatively uncharacterized 
member of the homeobox gene family, because this gene, 
when overexpressed, provided resistance to both cisplatin 
and mitomycin C. RHOXF2 clearly reduced drug-induced 
toxicities when overexpressed in the HEK293_M2 cell line 
treated with cisplatin (1 to 2.5 (iM) or with mitomycin C 
(20 to 80 nM). Rescue efficiency ranged from 45 to 60% 
and 30 to 42%, respectively (Figure 3a). This observation 
was confirmed using an independent cell biological assay 
in which overexpression of this transcription factor signifi- 
cantly reduced the formation of y-H2A.X foci in cells 
treated with 50 nM of mitomycin C for 2 days (Figure 3b). 

We next asked if RHOXF2's protective effect was re- 
stricted to cisplatin and mitomycin C or if this gene, when 
overexpressed, could also suppress the toxicity of DNA 
damaging agents with different mechanisms of action. Ac- 
cordingly, we assessed the cytotoxicity of antimetabolites 
(methotrexate, 5-fluorouracil), a radiomimetic (bleomycin), 
replication inhibitors (camptothecin, doxorubicin, etopo- 
side), mitaplatin, 4-nitroquinoline-l -oxide and several ex- 
perimental platinum anticancer agents (PT-ACRAMTU 
(EN), PT-ACRAMTU(PN), PT-ATUCA, PT-AMIDIN) on 
HEK293-M2 cells expressing or not the transcription factor 
(Figure 3c; Additional file 8). With the exception of the 
topoisomerase I inhibitor camptothecin, all drugs whose 
toxicity was suppressed by RHOXF2 overexpression dir- 
ectly bind to DNA to exert their effects. 



These observations motivated us to ask (i) if and how 
RHOXF2, an uncharacterized transcription factor, mod- 
ulates gene expression in HEK293_M2 cells and (ii) if 
the nature of the regulated genes can illuminate the pro- 
tective response to DNA damaging agents. We cultured 
HEK293_M2 cells with or without overexpression of 
RHOXF2 in the presence of 40 nM of mitomycin C 
or DMSO (control) and profiled their transcriptional 
response by RNA-seq. The sequencing data confirmed 
the RHOXF2 transcript was absent in the uninduced 
HEK293_M2 samples and verified its strong induction in 
the presence of doxycycline regardless of the drug treat- 
ment (Additional file 9). Neither RHOXFl nor RHOXF2B 
expression was affected by RHOXF2 overexpression 
(Additional file 9). Interestingly, and despite their limited 
number. Gene Ontology analysis of the genes whose 
expression was induced during RHOXF2 overexpres- 
sion showed that the most significantly enriched path- 
ways are related to DNA damage processes {P<7 x 10'^ 
and P<3x 10'^), stress response (P < 3 x 10"^), apop- 
tosis (P < 4 X 10'^) and cell cycle {P<7x 10'^) (Figure 3d; 
Additional file 9). As anticipated, in the absence of 
RHOXF2 and the presence of mitomycin C, we found a 
strong enrichment for genes related to the response to 
stress (P < 3 X 10'^) and regulation of protein metabolic 
process {P < 9.5 x 10"^). Finally, in the presence of RHOXF2, 
genes involved in the regulation of signal transduction 
(P < 4 X 10'^) were induced during drug treatment. There- 
fore, we suggest that when treated with DNA damaging 
agents, the protective effect conferred by overexpression 
of RHOXF2 is due, in part, to its activation of several key 
genes involved in the response to DNA damage. 

Finally, we asked if the RHOXF2-dependent protective 
effect observed in HEK293_M2 cells was restricted to 
one cell type. We then tested if RHOXF2 overexpression 
could confer resistance to the rtTA-expressing human 
breast adenocarcinoma cell line MCF7_M2 and the 
adenocarcinomic human alveolar basal epithelial cell 
line A549_M2. As quantified by both a two-fold shift 
in IC50 (44.91 \iM to 88.17 \iM) and by the 21% decrease 
in toxicity at 3 (iM, RHOXF2 overexpression protected 
MCF7_M2 cells against cisplatin- mediated toxicity 
(Additional file 10a). In the absence of RHOXF2, addi- 
tion of doxycycline resulted in a modest increase in IC50 
(41.06 [iM to 58.59 [iM; empty vector). The effect of 
the vector alone, while detectable, is less than that ob- 
served in the presence of RHOXF2. In contrast, although 
RHOXF2 overexpression in A549_M2 cells provided 
some apparent resistance (shift in IC50 from 448 nM to 
1,099 nM), the measured effect was, in fact, largely due 
to doxycycline (shift in IC50 from 718 nM to 1,454 nM 
with empty vector) (Additional file 10b). In conclusion, 
our experiments suggest that the protective effect of 
RHOXF2 is not confined to a single genetic background 
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2 2.5 cisplatin (|jM) 

51.25 45.84 reSCUG (%) 




^ Empty vector, nodox 
^ Empty vector, with dox 
I RH0XF2 vector, no dox 
I RH0XF2 vector, with dox 



80 mitomycin C (nIVI) 

29.95 rescue (%) 




dox vs no dox (no drug) 



-dox 

50nM mitomycin C 



+dox 
SOnlVI mitomycin C 



signal transduction in response to DNA damage 
response to hiyperoxia 
DNA damage checkpoint 
stress-induced premature senescence 
inflammatory cell apoptotic process 
regulation of interptiase of mitotic cell cycle 
intracellular receptor mediated signaling pathway 



mitomycin C vs no drug (no dox) 




Drug 


Type* 


Mode of action 


RH0XF2 

overexpression rescue 


Cisplatin 


other antineoplastic agents 


DNA cross linking, alkylation 


Rescue 


MitomycinC 


Cytotoxic antibiotics and related substances 


DNA cross linking, alkylating like 


Rescue 


IVIitaplatin 


None 


Bind to nuclear DNA and mitochondrial pyruvate dehydrogenase 


Rescue 


PT-ACRAMTU(EN) 


None 


Bind DNA 


Rescue 


PT-ACRAIVITU(PN) 


None 


Bind DNA 


Rescue 


PT-ATUCA 


None 


Bind DNA 


Rescue 


Camptothecin 


None 


Topoisomerase 1 inhibitor 


Rescue 


IVIetliotrexate 


Antimetabolites, Immunosuppressants 


Inhibit the metabolism of folic acid 


No rescue 


5-fluorouracil 


Antimetabolites 


Pyrimidine analog, irreversible inhibition of thymidylate synthase 


No rescue 


Bleomycin 


Cytotoxic antibiotics and related substances 


Intercalation or interaction with DNA minor groove 


No rescue 


Doxorubicin 


Cytotoxic antibiotics and related substances 


Intercalating DNA, inhibition of topoisomerase II progression 


No rescue 


Etoposide 


Plant alkaloids and other natural products 


Topoisomerase II inhibitor 


No rescue 


Hydroxyurea 


Other antineoplastic agents 


Ribonucleotidereductase inhibition 


No rescue 


4 - nitroquinoline -1 -oxide 


None 


DNA adduct forming agent 


No rescue 


PT-AMIDIN 


None 


Bind DNA 


No rescue 



response to stress 
regulation of protein metabolic process 
positive regulation of innate immune response 
heart development 
response to DNA damage stimulus 
intrinsic apoptotic signaling pathway 
toll-like receptor signaling pathway 
negative regulation of cell cycle 
positive regulation of histone H3-K4 methylation 
regulation of protein kinase activity 
positive regulation of gene expression 



Anatomical Therapeutic Chemical Classification System 



mitomycin C vs no drug (with dox) 



regulation of signal transduction 
regulation of execution phase of apoptosis 
covalent chromatin modification 
histone acetylation 
regulation of cell communication 



Figure 3 RH0XF2 overexpression in HEK293_IV12 cells conferred resistance to a wide variety of DNA damaging agents, (a) In HEK293_M2 
cells, RH0XF2 overexpression rescued cisplatin and mitomycin C toxicity. Stable RH0XF2-expressing cells were cultured in the presence of an 
increasing concentration of cisplatin or mitomycin C and their growth was compared to stable cells with the empty vector PB-TGcMV-Neo. 
The effect of RH0XF2 expression on cell viability was measured two days after drug exposure and compared to cells cultured in the absence 
of drug as a 100% viability control. P < 0.05, TukeyHSD test: ""significant difference between RH0XF2 vector with and without doxycycline 
(dox); """"significant difference between RH0XF2 vector with doxycycline and empty vector with doxycycline. Error bars represent standard error 
of the mean (n = 4). (b) Reduction in the number of Y-H2A.X foci by RH0XF2 overexpression in HEK293_M2 cells. Cells expressing or not 
RH0XF2 were treated with 50 nM of mitomycin C (approximately ICio) for 2 days and stained for the presence of Y-H2A.X foci. Following 
RH0XF2 overexpression, the number of Y-H2A.X foci per nucleus dropped from 20.71 ± 1.49 to 15.73 ±0.92 (24% rescue). Nuclear DNA was 
stained with DAPI. Quantification of Y-H2A.X foci in >250 cells from triplicate experiments. Scale bar: 10 Mm. (c) In HEK293_M2 cells, RH0XF2 
overexpression conferred resistance to various DNA damaging agents. RH0XF2 drug suppression capacity was tested using DNA damaging 
agents with various modes of action as previously described for cisplatin and mitomycin C. (d) Gene Ontology analysis of upregulated genes 
in HEK293_M2 cells. Cells expressing or not RH0XF2 were treated with 40 nM of mitomycin C for 2 days. Based on their P-values, the most 
enriched biological processes are shown. 



and expression of the transcription factor is sufficient to 
protect both HEK293 and MCF7 cells against the DNA 
damaging agent cisplatin. As with all large-scale screens, 
the biological significance of this observation should be 
further confirmed, for example, by interrogating a larger 
panel of cell lines. 

Based on its viability and transcriptional phenotypes in 
HEK293_M2 cells, we further investigated the potential 
role for RHOXF2 in neoplastic cells. 



In normal tissue, RH0XF2 expression is confined to testis 
but RH0XF2 is expressed in several cancer cell lines 

In a small-scale study performed by northern blot analysis 
on normal tissues, expression of the transcription factor 
RHOXF2 was initially reported to be restricted to testis 
[35]. We performed a large-scale systematic survey interro- 
gating a more representative panel of 20 different normal 
human tissues (3 donors per tissue type) by quantitative 
RT-PCR (Figure 4a). We first confirmed RHOXF2's testis- 
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Figure 4 RH0XF2 is expressed in various cancer cell lines and modulates K562 resistance to DNA damaging agents, (a) In normal 
human tissues, RH0XF2 was exclusively detected in testis. Total RNA from 20 distinct tissues (mix of 3 different donors) was reverse transcribed 
into first strand cDNA and used as template for quantitative PGR. Three distinct pairs of primers covering exons 1, 2 and 4 were used. RH0XF2 
expression level was calculated relative to the signal obtained for the amplification of housekeeping genes (CYCG, GUSB, ACTB, EEFlAl, GAPDH). 
(b) Representative example of immunohistochemical staining for RH0XF2 in normal testis. Section has been counterstained with Mayer's 
hematoxylin, (c) RH0XF2 was detected in various tumor cell lines by western blotting, (d) RH0XF2 shRNA validation in the human chronic 
myelogenous leukemia cell line K562. WT, wild type. {e,f) Partial depletion of RH0XF2 increased K562 sensitivity to cisplatin and mitomycin C. 
Final numbers of viable cells were calculated in the absence of drug (inset). Student's t-test; *P<0.05; **P<0.01; means ± standard error of the 
mean for n = 4 experiments. 



specificity and then investigated which specific testis cell 
types expressed RHOXF2 protein by immunohistochemis- 
try on normal testis samples. We detected a strong signal 
for RHOXF2 in spermatogonia and primary spermatocytes 
but not in spermatids (Figure 4b). Because RHOXF2 gene 
expression was detected in early stage spermatogenic cells 



and not in later stages, we hypothesized that RHOXF2 
might be involved in the self-renewal or maintenance of 
the undifferentiated state of the testis germ cells. 

Computational analysis predicts RHOXF2 to be a tes- 
ticular cancer candidate gene [36] and, in fact, the tran- 
script was detected in several types of human testicular 
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cancers [37] . Furthermore, in the present study, we show 
that RHOXF2 overexpression conferred resistance to cis- 
platin (Figure 3a) and that RHOXF2 is found in several 
other cancer cell lines, the non-small cell lung carcin- 
oma NCI-H1299, the blast phase chronic myelogenous 
leukemia K562, the multiple myeloma U266B1, the thy- 
roid gland medullary carcinoma TT (CRL-1803) and the 
chondrosarcoma SW1353 (Figure 4c; Additional file 11). 

In the chronic myelogenous leukemia K562 cell line, 
RH0XF2 depletion modulates the response to 
antineoplastic agents 

Our study showed that RHOXF2 overexpression confers 
resistance to cisplatin and mitomycin C in HEK293_M2 
cells, in which the protein is not endogenously expressed 
(Figure 4c; Additional file 9). We first generalized this 
observation by positing that overexpression or ectopic ex- 
pression of certain genes greatly reduces some detrimental 
drug effects on the growth of particular cancer cell lines. 
We then reasoned that if this is truly drug-gene specific, 
the opposite mechanism should experimentally verify 
it. More specifically, depletion of the endogenously 
expressed protein should confer sensitivity to the cell line 
during drug treatment. The most direct translation of this 
gene-drug specificity could then be exploited to under- 
stand drug resistance in cancer cell lines. Therefore, we 
asked if sensitivity to DNA damaging agents was observed 
when RHOXF2 is knocked down in cancer cells where 
the protein is normally expressed. We generated several 
stable cell lines to deplete RHOXF2 (NCI-H1299, K562, 
SW1353, and the human pancreatic HPAC as negative 
control) with five shRNAs that target the mRNA. Two 
shRNAs against GFP were used as a negative control. Two 
shRNAs showed a strong knock-down of the protein in all 
three RHOXF2-positive cell lines (Figure 4d; Additional 
file 12e,h). 

K562 showed a significant increase in sensitivity for 
both cisplatin (IC50: 3.231 (iM to 2.052 \iM) and mitomy- 
cin C (IC50: 377.8 nM to 250.4 nM) when the transcrip- 
tion factor was depleted, demonstrating the potential 
role of RHOXF2 in response to DNA damaging agents 
and particularly in response to DNA crosslinking agents 
(Figure 4e,f). Independent K562 transduction resulted in 
identical sensitivity to the drugs, showing the phenotype is 
independent of the lentiviral integration site. We ruled out 
the possibility that the calculated drug sensitivity simply 
reflected a growth difference among cell lines by demon- 
strating that the RHOXF2_shRNA and GFP_shRNA con- 
trol cell lines divided at the same rate (Figure 4e,f, insets). 
Finally, we assessed the robustness of our knock-downs in 
the presence of the drugs by confirming that the modula- 
tion of RHOXF2 protein level is maintained even when 
the K562 cells were exposed to cisplatin or mitomycin C 
at IC70 (Additional file 13a,b). 



As expected, no change was measured for the RHOXF2- 
negative HPAC cells (Additional file 12a,b). Interestingly, 
no increase in sensitivity to cisplatin and mitomycin C was 
observed for NCI-H1299 and SW1353 knock-down cell 
lines (Additional file 12c,d,f,g). 

The modest increase in sensitivity of K562 cells to DNA 
damaging agents can be explained by different factors. 
First, transcription factors are generally present in limited 
quantities in cells [38-40]. Here, although clearly depleted 
in the presence of the shRNAs, RHOXF2 is still present 
at a relatively high level in K562 cells (Figure 4d). There- 
fore, we speculate that despite the efficient knock-down, 
the limited shift in IC50 is in part due to the presence of 
enough protein to sustain a certain level of resistance dur- 
ing drug treatment. In order to test our first hypothesis, 
a complete knock-out of the transcription factor would 
be necessary (for example, using CRISPR technology). 
Secondly, a recent study in mouse embryonic stem cells 
reports that knock-down of a few transcription factors 
is associated with substantial transcriptome change [41]. 
These data not only indicate the robustness of the pluripo- 
tency gene network but also suggest that perhaps RHOXF2 
knock-down hardly perturbs the transcription factor net- 
work in the K562 cellular context. 

Discussion 

We have reduced to practice an elegant method to simul- 
taneously evaluate thousands of genes for their capacity to 
provide a survival advantage in the presence of a toxic 
dose of drug using cells expressing the entire hORF collec- 
tion and a microarray or sequencing-based readout. 

The availability of hORF collections represents great 
progress compared to traditional approaches (random 
mutagenesis, cDNA libraries, and so on) because it per- 
mits the systematic interrogation of a gene's biological 
function in a targeted manner. Such gain-of-function ap- 
proaches would be bolstered when combined with the 
reciprocal loss-of- function approach [42]. To date, sev- 
eral studies have already explored the availability of such 
collections. Most of the studies tested the hORFs func- 
tion in an array (well-based) format [43-45] that simpli- 
fies the readout and analysis of the results. Here, we 
tested the function of thousands of ORFs simultaneously 
in a competitive manner. Our pool approach greatly re- 
duced the tedious and expensive upstream steps of clon- 
ing, transduction and testing of the genes. The primary 
challenges to the pool approach are (i) potential loss of 
ORFs during cloning and infection (leading to represen- 
tation variability), (ii) the potential masking effect of 
overrepresented clones in a population during selection 
(for example, DHFR in the case of methotrexate) and 
(iii) the necessity for dedicated analysis of the final results 
(hits prioritization). In addition to future enhancements of 
the human ORFeome collection, other promising gain-of- 
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function approaches to assess drug mode of action are 
the development of functional variomics technology 
[46] and the exploration of human isoform space in hu- 
man cell lines. 

In developing the assay, we first determined the opti- 
mal period of time between the beginning of the gene 
expression and the addition of the chemical compound. 
To address that question, we quantified the number of 
Venus -positive cells 12 hours, 24 hours and 48 hours 
after induction of expression of the hORF collection. Be- 
cause the human gene and the Venus marker are co- 
transcribed, quantifying the level of Venus expression by 
flow cytometry serves as a proxy for the level of human 
gene expression. Because the number of Venus -positive 
cells peaks at 24 hours after doxycycline addition 
(Additional file 5), we postulated that 18 hours post- 
induction would be sufficient time after which to add 
drug to test the human gene for any protective effects. 

We then addressed the sensitivity of the technique by 
improving the ratio between the number of seeded cells, 
the growth area and the time of drug addition. Seeding 
500,000 cells in a T175 flask 33 hours before drug 
addition provided enough time and density for isolated 
cells to form separate micro-colonies that should over- 
express only one or a few genes. This 'clonogenic assay 
approach' was more sensitive to lower drug concentra- 
tions and provided a better selective environment for 
the surviving cells, presumably by minimizing signaling 
from a large population of dying cells that would arise at 
higher plating densities. Similarly, adding fresh medium 
and drug every 48 hours for the duration of the screen 
improved the selection process by washing away the 
dead and dying adherent cells that could have interfered 
with the subsequent screen readout. 

During the course of the screen, the cell population 
followed a stereotypical pattern of growth, starting with 
a dramatic decrease in cell number and disappearance of 
the majority of cells, followed by a stabilization of the 
population and concluding with the development of 
medium to large size colonies originating from the initial 
surviving cells. Daily visual inspection of the cell popula- 
tion was crucial to determine when to stop the drug se- 
lection and harvest the cells, typically after 20 to 30 
doubling times. 

Several refinements may enhance this assay. First, im- 
proving the coverage and expression quality of the hu- 
man genes using a more complete collection of fully 
sequenced ORFs as such collections become available 
[14]. Moreover, as illustrated in Additional file 6, relative 
abundance of the virally integrated hORFs is size- 
dependent. Therefore, to correct for that variation, prep- 
aration of the normalized amount of plasmid DNA for 
lentivirus production could be done as follows: i) reduce 
the pool size (less than 376 hORFs per pool as in our 



study); ii) hORFs should be grouped by gene length. One 
can also match the particular drug under interrogation 
to disease-appropriate cell types and environmental 
conditions. Whenever possible, experiments should be 
performed with an initial high number of cells to guar- 
antee the full representation of the collection during 
drug target screening. Although DNA damaging resist- 
ance was observed in the RHOXF2-overexpressing 
HEK293_M2 cells, it would be informative to further 
study if that observation is dependent on RHOXF2 ex- 
pression status. Finally, next-generation sequencing will 
facilitate the throughput of hit detection and reduce 
the number of false positives arising from microarray 
cross-hybridization. 

Our study revealed several genes whose functions re- 
late to the action of the drugs tested. RHOXF2 is of a 
particular interest because of (i) its strong rescue pheno- 
type in HEK293_M2 cells, (ii) its capacity to suppress 
both cisplatin and mitomycin C toxicity, and (iii) the 
dearth of functional information on this transcription 
factor. Homeobox genes encode transcription factors 
that play a central role during embryogenesis. RH0XF2 
belongs to the Rhox family of genes, which are expressed 
not only during embryogenesis but also in adult repro- 
ductive tissue [47]. In normal tissue, RHOXF2 expres- 
sion is likely restricted to testis but its function remains 
largely unknown [35,48]. Here, we demonstrated that (i) 
the presence of RHOXF2 was sufficient to lower both 
cisplatin and mitomycin C toxicity in HEK293_M2 cells, 
and (ii) the transcription factor might exert its effect via 
the activation of genes involved in response to DNA 
damage. Furthermore, RHOXF2 protein expression was 
detected in several cancer cell lines. In one of these lines 
(chronic myelogenous leukemia/K562) knocking down 
the protein increased the toxicity for both DNA dam- 
aging agents. Interestingly, although no growth disad- 
vantage was observed in K562 cells partially depleted 
for RHOXF2 (Figure 4e,f), its depletion via shRNA in- 
creased the sensitivity of this leukemia cell line to several 
pharmacological compounds. We therefore speculate 
that although not a driver of tumorigenesis, RHOXF2 
could confer on the neoplastic cells a selective advantage 
during drug treatment. 

Conclusions 

We demonstrate the usefulness of our approach in order 
to understand drug mechanisms of action by combining 
genetic perturbation with drug treatment. Here we reca- 
pitulated examples of drug resistance and uncovered 
several unanticipated drug-target interactions by virtue 
of the suppression of chemical toxicity observed upon 
gene overexpression. We also provide a step-by-step 
protocol for performing such screens and for analyzing 
the data (Additional file 1). Moreover, our innovative 
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overexpression approach could also be applied to identify 
genes whose up-regulation could be toxic or enhance cell 
proliferation in different cell types. For example, the assay 
could be useful to uncover tumor-specific activity in the 
context of genome engineering as well as for the discovery 
of cell-type-specific oncogenes. 

Additional files 



Additional file 1: Step-by-step protocol for a genome-wide 
mammalian overexpression drug screen. 

Additional file 2: Plasmid map of the lentiviral vector pLD-T-IRES- 
Venus-WPRE-stop. 

Additional file 3: Original immunoblotting showing the conditional 
expression of DHFR in HEK293_IVI2 cells. Addition of 4 ng/ml, 16 ng/ml, 
32 ng/ml, 63 ng/ml and 2,000 ng/ml of doxycycline induced DHFR protein 
expression in the stable cell line HEK293_I\/12. Expression level was compared 
to alpha tubulin as a loading control. The asterisk marks a human protein 
that cross-reacts with the anti-DHFR antibody. 

Additional file 4: DHFR was identified as the predominant gene in 
this pool of 376 hORFs whose overexpression provided resistance 
to methotrexate. HEK293_M2 cells harboring a minipool collection of 
376 hORFs (including DHFR) were grown in the presence of a lethal dose 
of methotrexate. The nature of the hORFs conferring resistance to the 
drug was identified by plotting the log2 of the signal intensity for each 
hORF in the cells cultured in the presence of methotrexate on the x-axis; 
and by plotting the log2 ratio of the signal intensity for each hORF of the 
cells cultured in presence of the drug divided by the signal intensity for 
each hORF of the cells grown in presence of DMSO on the y-axis. 

Additional file 5: Detection of Venus-expressing HEK293_IVI2 cells 
after 12, 24 and 48 hours of induction. (a,b) HEK293_M2 cells stably 
transduced with the hORFeome collection (b) or not (a) were cultured in 
the presence of doxycycline (2 pg/ml) for 12, 24 and 48 hours. After gene 
induction, cell number and intensity of the Venus fluorescence were 
measured by flow cytometry. The percentage of Venus positive cells was 
compared to the total number of living cells measured. 

Additional file 6: Human ORFeome relative abundance in 
HEK293_IVI2s at TO. 

Additional file 7: Plasmid map of the transposon-based vector 
PB-TGcMV-Neo. 

Additional file 8: In HEK293_M2 cells, RH0XF2 overexpression 
conferred resistance to various DNA damaging agents. Stable RH0XF2 
cells was cultured in the presence of an increasing concentration of DNA 
damaging agents and their growth was compared to the stable cells with 
the empty vector PB-TGcMV-Neo. The effect of RH0XF2 expression on 
cell viability was measured two days after drug exposure and compared to 
cells cultured in the absence of drug as a 1 00% viability control. TukeyHSD 
test, P< 0.05; "significant difference between RH0XF2 vector with and 
without doxycycline; ""significant difference between RH0XF2 vector with 
doxycycline and empty vector with doxycycline. Error bars represent 
standard error of the mean (n = 4). 

Additional file 9: RH0XF2 raw FPKM values and Gene Ontology 
enrichment. 

Additional file 10: RH0XF2 overexpression rescued cisplatin toxicity 
in HEK293_I\/I2 and l\/ICF7_IVI2 but not A549_IVI2 cells. Stable RH0XF2 
cells were cultured in the presence of an increasing concentration of 
cisplatin and their growth was compared to the stable cells with the empty 
vector PB-TGcMV-Neo. The effect of RH0XF2 expression on cell viability 
was measured three days after drug exposure and compared to cell 
cultured in the absence of drug as a 100% viability control. Lines show the 
nonlinear fit of a variable-slope dose-response model for (a) MCF7_M2 cells, 
(b) A549_M2 cells and (c) HEK293_M2 cells as positive control; (d) resulting 
IC50S. Data analysis was performed using the 'drc' package in R. Experiments 
were done in triplicate. 



Additional file 11: RH0XF2 was detected in various tumor cell lines 
by western blotting. 

Additional file 12: RH0XF2 knocked-down in HPAC, NCI-HI 299 and 
SW1353 cells does not modulate cisplatin and mitomycin C 
sensitivities, (a-g) Dose-response curves for HPAC (a,b), NCI-H1299 (c,d) 
and SW1353 (f,g) were established after 2 days of exposure to cisplatin 
(a,c,f) or mitomycin C (b,d,g). Percentage of growth inhibition was 
calculated by comparing the number of cells treated with drug to the 
number of cells cultured in media with DMSO as control. Error bars 
represent standard error of the mean (n = 4). RH0XF2 depletion in 
NCI-H1299 (e) and SW1353 cells (h). Total cell extracts for each cell line 
were used to detect the presence of RH0XF2 by western blotting. 
Expression level was compared to the alpha tubulin as loading control. 

Additional file 13: RH0XF2 knocked down in K562 cells. (a,b) Cell 
lines in which RH0XF2 and GFP were knocked down were grown in the 
presence of 35 \xW\ cisplatin (a) or 7 |jM of mitomycin C (b) for 2 days. 
Total cell extracts for each cell line were used to detect the presence of 
RH0XF2 by western blotting. Expression level was compared to alpha 
tubulin as loading control, (c) Total cell extracts from a RH0XF2-expressing 
cell line and a cell line with GFP knocked down but overexpressing RH0XF2 
were used to detect the presence of RH0XF2 by western blotting. 
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